Action Classification and Highlighting in Videos
نویسندگان
چکیده
Inspired by recent advances in neural machine translation, that jointly align and translate using encoder-decoder networks equipped with attention, we propose an attentionbased LSTM model for human activity recognition. Our model jointly learns to classify actions and highlight frames associated with the action, by attending to salient visual information through a jointly learned soft-attention networks. We explore attention informed by various forms of visual semantic features, including those encoding actions, objects and scenes. We qualitatively show that soft-attention can learn to effectively attend to important objects and scene information correlated with specific human actions. Further, we show that, quantitatively, our attention-based LSTM outperforms the vanilla LSTM and CNN models used by stateof-the-art methods. On a large-scale youtube video dataset, ActivityNet [4], our model outperforms competing methods in action classification.
منابع مشابه
The THUMOS challenge on action recognition for videos "in the wild"
Automatically recognizing and localizing wide ranges of human actions are crucial for video understanding. Towards this goal, the THUMOS challenge was introduced in 2013 to serve as a benchmark for action recognition. Until then, video action recognition, including THUMOS challenge, had focused primarily on the classification of pre-segmented (i.e., trimmed) videos, which is an artificial task....
متن کاملAction Change Detection in Video Based on HOG
Background and Objectives: Action recognition, as the processes of labeling an unknown action of a query video, is a challenging problem, due to the event complexity, variations in imaging conditions, and intra- and inter-individual action-variability. A number of solutions proposed to solve action recognition problem. Many of these frameworks suppose that each video sequence includes only one ...
متن کاملSimilarity Constrained Latent Support Vector Machine: An Application to Weakly Supervised Action Classification
We present a novel algorithm for weakly supervised action classification in videos. We assume we are given training videos annotated only with action class labels. We learn a model that can classify unseen test videos, as well as localize a region of interest in the video that captures the discriminative essence of the action class. A novel Similarity Constrained Latent Support Vector Machine m...
متن کاملThe Interaction between Reflective Thinking and Grade Dropping: An Alternative Assessment Policy
The present study aimed to investigate the interaction among grade dropping and reflective thinking abilities of the participants and to also check if action research enhances learners’ reflective thinking. A cyclic action research was run for 8 sessions. Kember et al.’s (2010) reflective thinking questionnaire and three in-term quizzes were administered. Students also made questions based on t...
متن کاملVideo popularity characterization centered on news-on-demand
Video popularity characterization, in a News-on-demand service, is the core issue of this paper. It will concentrate on the digital edition of six Spanish regional newspapers, where the different news articles are classified into a wide variety of issues. Request video data has been analyzed, including how accesses were distributed along the different topics, during a period of nine months, whe...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1708.09522 شماره
صفحات -
تاریخ انتشار 2017